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1.  INTRODUCTION 


Background 

The  relation  between  search  theory  and  information  theory  has  had  a thorny  history. 

A 1967  view  was  summarized  concisely  by  Koopman  (reference  1):  "Ever  since  the 
mid -nineteen -forties  when  the  theories  of  information  and  of  search  became  subjects 
of  general  interest,  attempts  have  been  made  to  apply  the  theory  of  information  to  prob- 
lems of  search.  These  have  proved  disappointing;  neither  the  formulas  nor  the  concepts 
of  the  former  theory  have  found  a place  in  clarifying  the  problems  of  the  latter."  These 
views  doubtless  reflected  much  of  Koopman's  own  research  on  the  subject. 

An  independent  contribution  to  the  generally  pessimistic  view  of  the  matter  was  the 
1961  paper  by  Mela  (reference  2).  He  presented  numerical  examples  for  simple  search 
models,  demonstrating  that  a search  policy  that  is  designed  to  maximize  detection  prob- 
ability does  not  necessarily  maximize  either  the  gain  in  expected  information  or  the 
probability  of  correctly  committing  forces  on  the  basis  of  search  outcomes.  Mela  con- 
cluded: "...it  does  not  seem  likely  that  there  is  any  intimate  connection  between  search 
theory  and  information  theory."  And:  "...search  theory  should  be  considered  in 
connection  with  the  general  theory  of  statistical  decisions  rather  than  with  information 
theory." 

This  negative  view  was  reinforced  by  a later  paper  by  Pollock  (reference  3)  in 
1971,  He  presented  yet  another  numerical  example,  in  which  different  search  policies 
were  necessary  to  maximize  either:  the  probability  of  detection  as  a result  of  the 
search,  or  the  probability  of  guessing  the  position  of  the  target  after  the  search,  or 
the  information  gained  during  the  search. 

These  negative  findings  had  a clearly  inhibiting  effect  on  research,  and  relatively 
little  effort  has  been  devoted  to  the  connections  between  information  and  search  for  the 
past  fifteen  years.  Nonetheless,  the  intuitive  appeal  of  information  theory  remains  strong, 
and  more  recently  the  tide  of  pessimism  has  been  stemmed  somewhat.  In  1973,  for 
instance,  Richardson  (reference  4)  used  Monte  Carlo  simulation  to  explore  alternative 
surveillance  policies  in  a false -target  environment.  Target  motion  based  on  a Markov 
process  was  also  included.  Richardson's  policy  options  included:  (a)  an  optimal  single- 
stage  look-ahead  policy  (about  the  same  as  maximizing  the  probability  of  a right  guess 
after  every  search  stage);  (b)  a policy  to  maximize  information  gain;  (c)  a policy  to 
search  the  cell  with  the  highest  a priori  probability;  (d)  a uniform  surveillance  policy 
(cycling  systematically  through  all  cells), 

Richardson's  results  showed  that  the  maximum -Information -gain  policy  was  generally 
(but  not  uniformly)  best.  In  his  summary,  he  said;  "The  principal  conclusion. . .is  that 
the  maximum  Information  gain  policy. . .appears  to  have  desirable  characteristics  in  the 
idealized  surveillance  scenario  considered.  Among  these  characteristics. , .are  good 
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initial  behavior  in  the  early  stages,  ami  good  asymptotic  behavior  in  the  later  stages. 

The  initial  beliavior  is  measured  principally  by  comparison  with  the  optimal  single-stage 
look-ahead  policy. . .which  is  designed  to  be  good  in  the  early  stages.  The  asymptotic 
behavior  is  measured  principally  by  comparison  with  the  uniform  surveillance  policy, 
which,  for  a stationary  target,  is  guaranteed  to  converge  to  1. . . . " 

Finally,  Inspired  by  Richardson's  simulation  findings,  Barker  (reference  5)  proved 
these  two  theorems  analytically: 

"I:  Suppose  that  the  detection  function  is  given  by  b(u)  = 1 - e U for  u * 0. 

Then,  subject  to  a constraint  on  total  search  effort,  the  allocation  of  search  effort 
which  maximizes  the  probability  of  detection  also  maximizes  the  entropy  of  the  posterior 
distribution. " 

And  the  converse: 

"II:  Let  b be  a regular  detection  function.  Suppose  that  for  every  prior 
target  location  distribution  and  for  every  constraint  on  total  search  effort,  any 
allocation  of  search  effort  which  maximizes  the  probability  of  detection  also 
maximizes  the  entropy  of  the  posterior  distribution.  Then,  b(u)  = 1 - e "au  for 
some  positive  constant  a." 

These  findings,  taken  as  a whole,  appear  to  be  inconsistent.  In  particular,  Mela's 
numerical  example  seems,  at  least  superficially,  to  be  in  clear  contradiction  to 
Barker's  theorems. 

If  science  teaches  us  anything,  it  should  be  that  where  contradictions  abound, 
there  lies  a fertile  field  for  research.  Accordingly,  this  paper  reexamines  the  relation 
of  search  theory  to  information  theory  and  and  attempts  to  reconcile  past  results,  to 
clarify  the  current  state  of  knowledge,  and  to  remove  some  impediments  to  further 
work  on  the  subject. 

Plan  of  the  Paper 

Section  2 establishes  the  mathematical  framework  of  the  problem.  A model  for  tho 
search  process  is  constructed.  A description  of  the  problem  in  terms  of  Information 
theory  Is  provided,  and  distinctions  are  drawn  among  several  possible  definitions  of 
information.  These  are  then  related  to  quantities  discussed  In  the  earlier  literature, 

Section  3 reexamines  the  examples  of  Mela  and  Pollock  and  extends  their  results 
numerically  to  the  case  of  a large  amount  of  search  effort.  Section  4 treats  analytically 
the  case  when  a large  search  effort  Is  applied:  section  5 treats  analytically  the  case  in 
which  the  search  effort  Is  infinitesimal. 
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Section  6 summarizes  the  findings  and  places  the  results  of  this  paper  in  a 
framework  with  past  work. 


2.  PROBLEM  STATEMENT 


The  Search  Model 

We  assume  a simple  model  for  the  search  process.  A single  target  is  in  one  of 
J cells.  Our  prior  knowledge  of  its  position  is  described  by  a set  of  probabilities, 

J 

p.»  j£  J>  P,  = 1*  We  conduct  a search  operation  in  which  various  amounts  of 
J J=1  J 

search  effort,  z^,  are  applied  to  each  of  the  J cells.  The  total  amount  of  search 
effort,  C,  is  fixed,  so  that  the  z^  amounts  of  search  are  subject  to  a constraint: 


j=l 


(2.1) 


For  our  general  treatment,  the  z^  need  not  be  quantized.  Special  cases,  for  which 

the  search  effort  is  applied  in  multiples  of  a fixed  unit  (the  "look"),  appear  in  the 
examples  of  Mela  and  Pollock. 

We  assume  that  the  detection  process  is  governed  by  an  exponential  detection 
function.  This  means  that  the  conditional  detection  probability  for  cell  j,  given  that 
the  target  is  in  cell  j and  that  an  amount  of  search  effort,  z , is  applied  there  is: 


Pr  (det  j | j # z j ) « 1 - exp(-a,z.)  . 
is  a characteristic  parameter  of  the  detection  process  in  cell  J. 


(2.2) 


This  formulation  subsumes  the  discrete -look  cases  of  Mela  and  Pollock.  If  is 
the  single-look  conditional  detection  probability,  then,  for  n looks  in  cell  j: 


Pr (det  j | j , n j ) = 1 - (l-q  ) 3 # 


(2.3) 


which  is  equivalent  to  eq.  2,2  when  n^  is  Identified  with  z.,  and  is  identified  with 
“In(l-qj)  . 

Finally,  we  assume  that  no  false  alarms  occur. 
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Within  the  framework  of  this  basic  model,  a specific  allocation  of  search  effort  may 
lead  to  J+l  possible  outcomes:  cither  a single  detection  in  any  of  the  J cells  or  no 
detections  at  all.  Because  we  have  assumed  no  false  alarms,  a detection  in  cell  k pro- 
duces a posterior  probability  distribution: 

p'  = 1;  p!  = 0,  j/k£J.  (2.4) 

k D 

4 

Failure  to  achieve  detection  in  any  cell  produces  a posterior  distribution: 

Pj  " Pj  exp("ajZj)/[k:51Pk  ^P^kV]  • (2.5) 

For  convenience  of  notation,  we  define 


CPj  = CP  (Zj 


Pj  exp(-ttjZj)  . 


(2.6) 


(These  cp^  functions  correspond  to  the  "failure  densities,"  /3(j),  used  by  Barker.) 
For  nondetection,  the  posterior  distribution  is: 


J 

P-5  = cpi/  S cpk 
3 3 k=l  * 


(2.7) 


The  unconditional  probability  of  detection  in  the  jth  cell  is,  in  this  notation: 

PD  . = p . (1  - exp  (-a  . z , ) ) = p , - cp  . , 

U * 3 J Fj 

and  the  overall  detection  probability  is: 

J J 

PD  = E PD.  = 1 - E cp,  . 

j«l  3 j-1  3 


(2.8) 


(2.9) 


The  Information  Description 


It  is  possible  to  define  many  "information"  quantities  related  to  search  operations. 
Values  for  self  information  and  mutual  information,  for  conditional  information  and 
unconditional  information,  for  information  as  a random  variable  and  for  ensemble 
average  information  --  any  of  these  can  be  computed  at  each  step  of  a search.  To  avoid 
ambiguity,  we  proceed  with  caution  in  defining  the  specific  "information"  we  wish  to 
discuss  in  this  paper  and  in  relating  that  definition  to  the  quantities  discussed  in  the 
references. 
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First,  we  define  two  probability  spaces,  X and  Y , that  are  concerned, 
respectively,  with  the  true  position  of  the  target  and  the  observed  response  of  the 
search  sensor.  Let  the  event  x represent  the  presence  of  the  target  in  cell  j.  The 


space  X has  dimension  J,  and  the  discrete  event  probabilities  are  given  by  p (x  ) = 

x J 

P.t  jC  J.  Let  the  event  y be  the  null  response  to  search,  and  let  the  event  y re- 

J U K 

present  a detection  in  cell  k . The  event  probabilities  in  the  Y space  can  be  con- 
structed from  the  appropriate  probabilities  in  the  X space,  together  with  the  condi- 

J 


tional  detection  probabilities:  p (y  ) = V Pv  I IX|)  • In  our  present 

Y k j_2  Y | a k | j X j 

example,  the  dimension  of  the  Y space  is  J + 1;  it  could  be  larger  if  multiple 
detections  or  false  alarms  were  considered. 


Our  principai  concern  in  this  paper  is  the  ensemble  average  mutual  information 
between  the  X and  Y ensembles.  By  definition  (reference  6),  the  mutual  information 
between  the  events  x,  and  y,  is: 

* Ir 


j 


Xjk  = ln  C(Pxl 


(2.10) 


This  quantity  is  a random  variable  in  the  product  space  X • Y.  Its  ensemble  average 
Ie 


■jk  ' 


Pvlv(^  Iy^Pv^tJ  (PX|  Y(x.  |yk)  )/pv(x.)  . (2.11) 


>1  k=0  XIY  j k Y k 


X'”j 


To  compute  this  ensemble  average,  we  use  the  following  relationships.  Implicit  in 
all  of  these  is  an  allocation  of  search  effort,  (z^)  : 


Px<*j) 


p.  , the  prior  probabilities: 


(2.12) 


(y0>  =Scp, 

j=i  J 


from  (2.9); 


(2.13) 
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PY  (yk>  = P]<  " * from  (2.8);  k = 1,  . . . ,J  (2.14) 


J 

S' 


PX  | Y *■ x j * y0 ) " ? 


(2.15) 


PX|Y<Xjli,k)  ■ 6jk 


k - 1 , . . . , J (2. 16) 


The  latter  two  equations  are  derived  by  appropriate  applications  of  Bayes'  rule.  When 
these  are  substituted  into  eq,  2,11,  we  obtain: 


= -L  In  p. 

j = l 3 


r J 

- -S  cp. 

L 3 = 1 3 


In 


cp. 


J 

Sep, 

k®*l 


/ ^ 

<E 


(2.17) 


The  first  term  is  recognized  as  the  entropy  of  the  prior  ensemble,  X,  which  we 


will  designate  HQ.  By  a general  result  of  information  theory,  the  mutual 
between  two  ensembles,  X and  Y , can  be  written: 

I„  - H (X)  - H (X I Y)  . 


[Information 


(2.18) 


Thus,  we  can  identify  the  bracketed  term  in  eq.  2.17  as  the  conditional  entropy  of  X, 
given  Y.  For  reasons  discussed  below,  we  designate  this  as  H . Thus: 


= 


fP j In  cp. 


J 

VP 


j = l 


The  mutual  information  is  then: 


*E  " H0  - HE 


vk-1 


fPi 


In 


’ J 

E 

4-1 


<Pi 


(2.19) 


(2.20) 


For  a given  prior  distribution,  HQ  is  a constant;  Hg,  and  consequently  Ig  , are  func- 
tionals of  the  allocation  of  search  effort  [z^],  as  well  as  functions  of  the  prior  prob- 
abilities, pj,  and  the  set  of  detection  parameters,  . 
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We  will  eventually  wish  to  investigate  the  extrema  of  1^,  but  H^,  is  more  convenient 

to  work  with.  We  note,  therefore,  that  because  of  the  minus  sign  in  eq.  2.20,  a 

maximum  of  H„  will  mean  a minimum  of  I , and  vice  versa. 

h b, 

Throughout  this  paper,  we  will  discuss  the  ensemble  average  mutual  information, 

I_  and  the  conditional  entropy  of  X given  Y — H_  s H(X|  Y)  — almost  interchange- 

C/  C, 

ably.  The  reader  should  therefore  bear  their  relationship  (eq.  2.20)  in  mind  as  he 
proceeds . 


An  alternative  approach  is  to  consider  the  seif-entropy  of  the  posterior  ensemble. 
If  detection  does  not  occur,  posterior  probabilities  pj  are  as  given  by  eq.  2.5  or  2.7; 

we  can,  therefore,  define  a posterior  self-entropy,  conditioned  on  nondetection  of  the 
target,  as: 


Conversely,  if  a detection  occurs  (and  by  our  assumption  there  are  no  false  detec- 
tions) eq.  2.4  holds  for  the  posterior  distribution,  and  the  posterior  entropy,  condi- 
tioned on  detection  in  any  k C J,  is: 


hd  = 


-Pk 


In 


Pk 


V' 


jA 


Pj  m Pi 


(2.23) 


- -1  In  1 - (J-l)  0 In  0 = 0 . (2.24) 

By  computing  these  two  conditional  entropies  with  the  appropriate  probabilities  of 
occurrence,  we  can  obtain  the  expected  entropy  of  the  posterior  ensemble: 


PD  HD  + PND  HND 


" PD  HD 


u-pD) 


ND 
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(2.25) 


Using  equations  2.9,  2.22,  and  2,24,  we  find  an  expression  for  Hp  that  is  identical 

to  eq.  2.19,  The  two  approaches  are  thus  equivalent,  and  the  expected  entropy,  H , 

defined  in  terms  of  the  expectation  over  the  posterior  distribution,  is  the  same  as  that 
arising  from  computing  the  average  mutual  information  between  the  two  ensembles.  The 
latter  is  the  more  fundamental  approach,  however;  it  should  be  followed  In  more  complex 
cases  such  as  those  involving  false  alarms . 

Having  defined  our  basic  terms,  we  can  now  return  to  the  references  to  see  what, 
in  fact,  the  previous  authors  have  discussed.  None  of  those  authors  recognized  explicitly 
the  possibilities  for  confusion  arising  from  the  existence  of  various  information  and 
entropy  values.  Each,  however,  was  consistent  in  doing  calculations  with  the  specific 
value  he  felt  to  be  appropriate.  Both  Mela  and  Pollock  used  I . Richardson  did  his  cal- 

culatlon  with  and  inferred  max  1^,  based  on  min  H^;  despite  the  consistency  of  his 

calculations,  he  did  not  draw  a careful  enough  distinction  between  H„  and  H.,_  in 

E ND 

presenting  his  conclusions.  Barker's  theorems  refer  to  . 

Because  Mela  and  Barker  were  discussing  different  quantities,  no  inference  of 
direct  contradiction  can  immediately  be  drawn.  The  ramifications  of  their  findings 
will  now  be  discussed. 
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3.  THE  EXAMPLES  CITED  BY  MELA  AND  POLLOCK 


The  examples  devised  by  Mela  and  Pollock  have  two  features  in  common:  Both 
consider  discrete  looks  only,  and  both  choose  the  total  number  of  looks  to  be  small--no 
more  than  the  number  of  search  cells,  J . They  differ  in  that  Mela  considers  situations 
in  which  both  the  prior  probabilities  and  the  conditional  detection  probabilities  are 
identical  in  all  search  cells;  Pollock.,  on  the  other  hand,  treats  the  case  in  which  both  prior 
probabilities  and  conditional  detection  probabilities  vary.  Tables  1-3  list  the  particulars 
of  their  examples,  expressed  in  the  notation  of  this  paper. 

TABLE  1 

MELA'S  FIRST  EXAMPLE 


J = 2; 


Pl  = p2  = 

.5; 

al  = a2  = 

.693 

SEARCH 

POLICY 

*o 

;D 

1E 

* Ho  ' 

(1,1) 

.5* 

.347 

(2,0) 

.375 

.380* 

The  starred  entries  in  the  tables  indicate  the  respective  maxima  of  P and  I for 

u h 

the  various  search  policies.  It  is  obvious  that  the  same  policy  leads  to  maxima  of  P^ 
and  I in  Mela's  second  example,  and  that  different  policies  are  required  to  maximize 
P and  I in  Mela’s  first  example  and  in  Pollock's  example.  From  this  it  is  self-evident 

D Ej 

that  a given  search  policy  does  not  always  maximize  both  detection  probability  and  inform- 
ation gain  at  the  same  time . 

What  is  not  self-evident  --  yet  is  often  asserted  --  is  that  this  fact  further  implies 
that  there  is  no  necessary  connection  between  the  extrema  of  P_  and  Ip  . As  we  show 

u hi 

below,  this  assertion  is,  in  fact,  false.  The  existence  of  counterexamples,  based  on  a 
small  number  of  policy  options  and  a small  amount  of  search  effort,  shows  that  the 
connection  between  the  extrema  of  P and  I is  not  universal.  In  a wide  range  of 

1 J E, 
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circumstances,  however,  the  extrema  of  and  1^  do  indeed  depend  on  identical 
search  policies . 


TABLE  2 


MELA1 

S SECOND  EXAMPLE 

J “ 3 ; 

Pi  " 

p2  " P3  " 

1/3? 

al  = 

a2  “ a3  " 

.693 

SEARCH 

POLICY 

XE  " H0  " HE 

(1,1,1) 

.5* 

.549* 

(2,1,0) 

.417 

.541 

(3,0,0) 

.292 

.478 

TABLE  3 

POLLOCK'S  EXAMPLE 

J = 3; 

Pi  = -1' 

p2  = .3, 

p3  « .6? 

* + ®, 

a2  « .511, 

a3  - .357 

SEARCH 

POLICY 

^D 

IE  “ H0  HE 

(1,0,0) 

.1 

.14* 

(0,1,0) 

.12 

.07 

(0,0,1) 

.18* 

.04 

-11- 

We  examine  these  first  by  relaxing  the  limitation  of  a small  amount  of  search  effort 
and  by  examining  and  1^  as  surfaces  in  a (J-l) -dimensional  search  allocation 

space.  Although  Mela  and  Pollock  do  not  deal  with  H^,  we  shall  examine  that  quantity 

numerically  also,  as  a basis  for  analysis  relating  to  Barker's  work. 

For  Mela's  first  example,  we  consider  a large  number  of  discrete  looks,  C,  allocated 
between  the  two  cells;  m looks  are  applied  in  cell  1,  and  C-m  in  cell  2,  We  then 
calculate  P^,  H^,  and  as  functions  of  m for  0 fim  SC.  Analytically: 

PD  = 1 - [2"m  + 2"(C"m)]/2 

Hnd  = £[2"m  + 2"(C’m)]/23 

+ [[2“m  + 2“(C_m)]/2}"1  [ ( 3m  2 ) / 2 ] • 

{ (m+1) 2_m  + (C-m  + l)2“(C“m)] 


(3.1) 


(3.2) 


He  = [[2"m  + 2~  <c“m)  ]/2)  6n  [ 2-m  + 2~  (C"m)  ] 
+ [ ( Urn  2 ) / 2 ] • [m2“m  + (C-m) 2“ (C_m) ] 


(3.3) 


These  quantities  are  plotted  in  figures  1-3  for  the  case  C=30  . is  plotted  on 

a linear  scale  (figure  2);  and  Hp  are  plotted  logarithmically,  as  -log  (1-P^)  and 

log  Hol  respectively  (figures  1 and  3). 
b 

The  obvious  content  of  figures  1-3  and  their  supporting  computations  is  that  each 

of  the  three  quantities  --  P_,  and  --  las  a maximum  at  the  same  position, 

D ND  n 

m=C/2=15.  Equal  allocation  of  search  effort  to  the  two  cells  produces  the  maximum 
detection  probability,  in  keeping  with  Mela's  finding  for  C=2,  and  the  maximum  value 
of  as  predicted  by  Barker's  first  theorem.  The  surprising  finding  is  that  the 

expected  information  gain,  I„,  shows  a minimum  (max  H„)  for  equal  allocation  of 

b — b 

search  effort.  Though  this  is  consistent  with  Mela's  calculation,  it  could  not  be 
anticipated  on  the  basis  of  previously  published  results. 
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Detection  probability:  -in  ( 1— Pq) 


Conditional  entropy : 


We  use  a similar  approach  to  Mela's  second  example,  with  l looks  allocated  to 
the  first  cell,  m looks  to  the  second,  and  C-^-m  to  the  third.  The  expressions  for 
PQ  (i;n),  (£,m),  and  Ilg  (£,  m)  are  completely  analogous  to  those  of  equations 

3.1  - 3.3  (but  too  unwieldly  to  reproduce  here).  Figures  4-6  show  the  three-dimensional 

plots  of  P„,  H .,  and  H„,  for  C=30,  0<m,  l <30,  0<m+is30.  As  before,  P and  H 
D nd  h u a 

are  plotted  logarithmically;  the  scale  is  linear. 

Figures  4-6  give  a general  idea  of  the  shape  of  the  surfaces.  The  perspective  of 

the  figures  is  so  oblique,  however,  that  precise  conclusions  must  rest  on  the  back-up 

computations.  These  say  that  the  uniform  search  allocation  (10,  10,  10)  produces  the 

maximum  of  P_  and  H.tr>,  and  the  minimum  of  H_  (maximum  L,).  The  maxima  of 
D ND  h b 

P_  and  R.r,  are  both  local  and  global  maxima.  But  the  minimum  of  H is  a local 
D ND  b 

minimum  only.  Smaller  values  occur  on  the  boundaries,  at  (15,  15,  0),  (15,  0,  15), 

and  (0,  15,  15).  Again,  these  findings  are  consistent  with  Mela's  C=3  case  and  with 

Barker's  theorem.  But  they  are  significantly  different  from  Mela’s  first  example,  in 

that  the  uniform  allocation  now  produces  a maximum  information  gain  rather  than  a 

minimum  and  that  this  maximum  is  relative,  not  absolute. 


Pollock's  example  ts  modified  slightly  here  for  graphical  presentation.  The 
essence  of  Pollock's  basic  example  is  the  variation  of  both  a ^ and  p^  from  cell  to 

cell.  To  achieve  extreme  variation  from  cell  to  cell,  Pollock  chooses  a value  of  1 
for  qj,  the  conditional  detection  probability  in  the  first  cell.  The  assumption  q^  = 1 

is  equivalent  to  © -+  “ in  our  notation.  This  has  the  unfortunate  effect  of  introducing 


a singularity  into  our  formalism,  because,  in  computing  cp^  , we  are  faced  with  the 


fact  that: 


-al z L - 

lim  lim  p^e  ‘ _ r 


lim  lim  p-^e 


To  avoid  this  singularity,  we  take  q - .9(),  which  implies  = 4.605.  This 

constitutes  "Pollock's  Example -Modified, " which  is  presented  in  figures  7-9,  in 
a format  comparable  to  that  of  Mela's  second  example.  The  effect  of  the  cell-to-cell 
variation  is  preserved  by  the  modified  example,  while  the  added  complications  of  the 
singularity  are  circumvented, 


As  we  shall  see  in  a later  section,  the  cell-to-cell  variation  of  either  or  pi 
will  destroy  the  symmetry  of  the  surfaces  in  ( & m)  space.  Variation  of  alone  will 
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MELA'S  SECOND  EXAMPLE  - DETECTION  PROBABILITY  vs.  SEARCH  ALLOCATION 


MELA'S  SECOND  EXAMPLE  - EXPECTED  ENTROPY  vs.  SEARCH  ALLOCATION 


displace  the  relative  locations  of  the  stationary  points  of  P^,  and  H^,  and  can 

in  fact  change  the  whole  character  of  the  surface  of  U . 

i k 

These  points  are  not  all  , obvious  from  figures  7-9.  It  is  clear  that  the  symmetry 
has  been  destroyed  in  all  three  figures.  no  longer  has  a local  extremum  within  the 

surface.  A saddle  point  appears  at  (2,  10,  18),  but  the  extreme  values  of  H„  occur 

at  edg  points.  and  retain  local  maxima,  at  (2,  11,  17)  and  (1,  11,  18), 

respectively. 

The  different  positions  for  the  maxima  of  ? and  could  be  Inferred  from 

Barker's  second  theorem;  the  different  positions  for  the  maximum  of  and  the  mini- 
mum of  H are  consistent  with  Pollock's  findings.  But  the  totally  different  character 

of  the  H surface  (a  saddle  point  in  place  of  a local  extremum)  could  not  have  been 

E 

anticipated  from  published  work. 


The  numerical  experiments  presented  in  this  section  were  designed  to  be  suggestive, 
not  definitive,  and  to  point  the  way  to  fruitful  lines  of  analysis.  What  the  experiments  have 
suggested  can  be  summarized  concisely: 


a. 


When  p^  and  a ^ are  the  same  for  all  search  cells: 


?D>  Hnd,  and  (or  Ip)  al)  have  local  extrema  at  a common  point. 

The  local  extrema  of  P^,  and  are  maxima. 

D ND 

The  local  extremum  of  II  may  be  either  a maximum  or  a minimum. 


The  local  maxima  of  and 


M.  are  also  global  maxima,  but  the 
» «IU 


b. 


local  extremum  of  Hp  Is  not  necessarily  a global  extremum. 
When  the  p,  and  a.  differ  from  cell  to  cell: 

1 i 


Hp  does  not  necessarily  have  a local  extremum. 

Pp  and  have  local  maxima.  The  positions  of  their  respective 

maxima  are  close,  hut  not  identical, 

The  next  section  of  this  paper  consists  of  analysis  based  on  these  observations. 
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POLLOCK'S  EXAMPLE  - MODIFIED  EXPECTED  ENTROPY  vs.  SEARCH  ALLOCATION 


4.  ANALYSIS  — LARGE  SEARCH  EFFORT 


In  this  section  we  pursue  analytically  some  of  the  ideas  suggested  in  the  previous 
section  by  the  numerical  extension  of  the  Mela  and  Pollock  examples  to  the  limit  of 
large  search  effort.  The  mathematical  framework  is  that  established  in  section  2.  We 
take  the  three  functionals  of  the  search  allocation  {z^}  : P^,  defined  in  eq.  2.9;  H^, 

defined  in  eq.  2.22;  and  H_,,  defined  in  eq.  2.19.  We  seek  to  determine  the  extrema  of 

these  functionals,  subject  to  the  constraint  on  total  search  effort  (eq.  2.1). 


We  defer  temporarily  the  precise  specification  of  a "large  amount  of  search  effort." 
We  assert  that  for  C > C*,  for  some  C*,  the  conventional  calculus  of  variation  technique 
using  Lagrange  multipliers  to  incorporate  constraints  is  applicable  and  will  lead  us  to 
the  desired  extrema.  The  determination  of  C*  in  terms  of  the  p and  will  follow 

as  part  of  the  analysis. 


First,  it  is  important  to  note  that  the  three  quantities 


on  only  two  independent  functionals  of  the 


PD’  HND’  and  HE  dePend 


and 


J 

As  ^Ccpj(Zj)  Incpj(Zj) 


J 

B * LJ  cp  . (z  .) 
j-1  3 3 


Specifically: 


PD  = 


1 - B 

H„n  = In  B - A/B 
= B In  B - A 


ND 

H„ 


(4.1) 


(4.2) 


(4.3) 

(4.4) 

(4.5) 
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We  can  obtain  some  genera!  insights  using  this  simplified  representation.  Let  the 
operator  6 signify  constrained  variation;  that  is,  for  a function  f,  5f  refers  to  the  set 

J 


of  J quantities  df/dz 


j’ 


subject  to  C = 21 


With  this  notation: 


6Pd  » -6B 

6Hnd  = 6 A (-  (1/B)  ) + 6B  |^(1/B)  + (A/B2) 
6He  = -6A  + 6 B ( 1 + In  B)  . 


(4.6) 

(4.7) 

(4.8) 


such  that  5A  and  6b  are  both  zero,  then  all  three  functionals  P_,  H.,_,  and  H„  have 

U MU  ii 

extrema  for  that  particular  allocation  (z*)  . Conversely,  if  6a  and  6b  are  not 

J 

simultaneously  equal  to  zero,  then  extrema  of  the  three  functionals  will  occur  for  different 
search  allocations,  given  respectively  by: 

6B  = 0 

5 A - 6B(1  + (A/B)) : (6A  / 6B  / 0) 

6 A - 6B(1  + In  B) : (&A  / 6B  / 0)  . 

Returning  to  the  specific  functional  form  of  A and  B , we  find  that: 

6 A = (dcpk/dzk)  (1  + In  cpk)  - \ 

(4.12) 

= + 1°  cPj^  ” ^ 

for  all  k C J . X is  a Lagrange  multiplier.  Similarly: 

SB  = -akcpk  - \ . (4.13) 


(4.9) 

(4.10) 

(4.11) 
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If  6a  and  6B  are  to  be  simultaneously  zero,  we  must  have: 


+ In  cpk)  - ctjCp j (1  + In  cp^) 


ak^k  = ajCpj 

for  all  j,  k£J.  Eliminating  cp  and  cp  between  these  equations,  we  obtain: 

j k 


<4.14) 

(4.15) 


a.  = u.  , for  all  j,k  e J . 
] K 


(4.16) 


Thus,  6A  and  6B  are  simultaneously  zero  only  if  a is  the  same  for  each  of  the  j search 
cells  in  J. 


"When  all  the  a are  equal,  eqs.  4.14  and  4. 15  reduce  to: 

J 

cpk(l  + In  cpk)  = cpjd  + -*-n  Vj) 


and 


cpk  - cpj  , for  all  j,k  6 J . 
The  only  solution  for  these  sets  of  equations  is: 


(4.17) 


(4.18) 


cp.  = const  = K 


for  all  k £ J . 


(4.19) 


This  equation  cannot  be  satisfied  for  small  values  of  the  total  search  effort,  C.  By 
requiring  it  to  be  satisfied,  we  can  determine  C*,  the  lower  bound  on  the  search  effort 
for  which  this  method  of  analysis  is  valid.  Using  the  definition  of  cp^,  (eq.  2.6),  we 
infer  from  eq.  4,19: 


"alZl 


“u222 


Pj  * 


(4.20) 
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In  this  formulation  we  have  assumed  that  the  p 
order,  so  that  = Mjn  3 . From  eq.  4,20, 


J 


values  are  arranged  in  nonincreasing 
with  a = a,  : 

j k 


2^  = (1/a)  In  (Pj/Pj)  : 


1,2  --  J-l 


or: 


C*  - (1/a)  [ 13  In  Pj  - J In  p. 


j-l 


■min 


For  C >C*  , eq.  4,19,  leads  to 


(4.21) 


(4.22) 


«*  (1/a)  In  (p^/K)  . 


(4.23) 


Further,  because  of  the  constraint  on  the  , this  constant  K can  be  evaluated  in 
terms  of  the  total  search  effort;  thus: 


zk 


(C/J)  + (1/a) 


in  pk 


(l/J) 


(4.24) 


This  a well  known  result,  closely  related  to  that  originally  derived  by  Koopman, 
(reference  8)  and  expounded  in  Stone  (reference  9).  The  optimum  allocations  for  Mela's 
two  examples  follow  from  this  trivially. 


We  may  now  summarize  these  results  in  two  theorems: 

Theorem  I:  For  the  assumptions  of  our  model  (section  2,1)  with  exponential  detec- 
tion functions  and  with  C >C*.  the  allocation  of  search  effort  {z*}  that  leads  to  an 

extremum  of  P^  also  leads  to  an  extremum  of  only  if  the  values  are  the  same 

for  all  k C J.  (This  is  the  alternative  statement  of  one  of  Barker’s  results.) 

Theorem  II:  For  the  assumptions  of  our  model  (section  2,1)  with  exponential  detec- 
tion functions  and  with  C >C*,  the  allocation  of  search  effort  { z*  } that  leads  to  an 

extremum  of  F_  also  leads  to  an  extremum  of  H„  only  if  the  a,  values  are  the  same 
D E k 

for  all  k£  J,  (New  result.) 


The  allocation  {z*]  that  leads  to  simultaneous  extrema  of  all  three  functionals  P^, 
HND>  and  He  is  given  by  eq.  4. 24. 
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We  must  now  find  out  whether  the  extrema  thus  determined  are  maxima  or  minima. 
To  do  this,  we  examine  the  second  derivatives.  Let  the  second  constrained  variation, 

2 2 
denoted  by  5 f,  be  defined  by  the  quadratic  form  £ C.C.  O f/3z  5z  ),  subject  to 

i.j  J J 

c = y,z. . 

f J 

2 

To  compute  6 f,  it  is  more  convenient  to  include  the  constraint  explicitly,  rather 
than  use  the  Lagrange  multiplier  technique.  This  is  done  by  noting  that  only  J-l  of  the 
z are  independent.  The  first  J-l  of  the  cp  are  taken  as  explicit  functions  of  their 

J ■ J j-l 

respective  z , and  the  remaining  cp  is  taken  as  a function  of  C - £ z . In  com - 

J J j=l  ^ 

puting  derivatives  with  respect  to  a specific  z^,  therefore,  both  cp^  and  cpj  must  be 
considered. 

With  this  convention,  the  various  second  partial  derivatives  can  be  written,  after 
some  manipulation,  as: 


a2Vdzk  " “ [v^k  + aj£pj. 


a2pD/dV2s,  " " 


^^ND'^k  ~ <:L//  ‘j  ^ ^k^k  ” aJipJ^ 


(4.25) 


2 2 

1 1/  iLi  cp  j ) (akcpk  + aypj) 


(4.26) 


3 HND/dzk3ze 


2 

aJ<Pj 


J 


1 + G,n  cpj  - ( <p  j Hn  j ) / ( ,S  cp  j ) 


J 

/ ( S cpj 

j-l  3 


+ (-  ak<pk  + a^)  (-  + a^) 
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a2Vazk  = "^k  + + (1/^cPj)  (ak^k 


-a^k (In  cpk  - In  S <Pj) 


-ajtpj (In  tpj  - In  S 9j) 


(4.27) 


9 H^V**  = 


aj'Pj 


u 

1 + 2n  cp  j - Bn  ( Z)  «Pj) 


+ (-akCpk  + aj9j)  (-a^  + ajtPj) 


These  are  the  second  variations  evaluated  at  the  extrema,  where  the  relationships 
obtained  by  equating  the  first  variation  to  zero  have  been  used  to  simplify  the  expressions, 
especially  in  the  case  of  H^. 

For  the  case  discussed  above,  «*  a and  = K for  all  j,  the  second  variations 
simplify  to: 


62pd  = -a2KD 


(4.28) 


62Hnd  - -({a2)/J)D 


(4.29) 


52Be  = -a2K(l  - In  J)D  . 


(4.30) 


D is  the  residual  quadratic  form: 


D - 5 Cj  + ( s c . V 2 0 • 


j-1 


j-1  j; 
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Noting  that  K,  by  definition,  is  a positive  constant  between  zero  and  one,  we  have: 

Theorem  III:  For  the  assumptions  of  our  model  (section  (2.1))  with  exponential 
detection  functions , with  C > C*  and  ail  » equal,  the  extrema  of  and  are 

always  maxima.  The  extremum  of  I-I^  is  a maximum  for  ]=2  and  a minimum  for  all 

J > 2. 

This  result  clarifies  a great  deal  of  the  mystery  and  confusion  resulting  from  Mela's 
paper.  His  two  cases,  J =2  and  J=3,  gave  contradictory  indications  about  H and  its 

Ei 

relation  to  P^,  and  produced  the  false  impression  that  there  was  no  connection  between 
detection  probability  and  information  gain  I (Mela's  I is  equal  to  (In  J - H ) in  our 

£1  Iw  ii 

terminology).  What  we  have  shown  here  is  that  for  the  important  case  in  which  the 
conditional  detection  probability  is  uniform  throughout  the  search  area,  optimizing  the 
detection  probability  is  identical  with  optimizing  the  change  in  the  amount  of  information. 
Moreover,  In  all  Instances  except  the  anomalous  case  ]-2,  maximizing  the  detection 
probability  is  identical  with  maximizing  the  information  gain. 

The  procedures  leading  to  Theorems  I-I1I  guarantee  only  that  we  have  found  local 
extrema  of  the  expected  entropy  surface.  To  complete  the  analysis,  we  must  further 
investigate: 

- Whether  there  are  additional  stationary  points  on  the  surface. 

- Whether  there  are  in  general  points  on  the  boundaries  of  the  surface  for 
which  the  value  of  exceeds  the  value  at  the  local  extremum,  as 

suggested  by  the  numerical  results  In  section  3 . 

We  note  first  the  values  of  the  pertinent  quantities  at  their  local  stationary  points. 
Using  the  solution  (eq.  4.24),  we  find  that: 


J 

Max  ( P J ~ 1 - exp  [InJ  + (1/J)  H In  p , ] exp(-uC/J)  (4.31) 
D i=l  1 


Max  = lnJ 


Extr  [He] 


InJ  [exp (InJ  + (1/J) 


J 

^ In  p . ) ] exp  (-aC/J)  .(4.32) 

i=l  1 
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For  equal  prior  probabilities,  p^  --  1/J,  these  reduce  to: 


Max  [PD]  = 1 - exp(-aC/J) 


(4.33) 

(4.34) 

Extr  [I  J = Hn  - Extr  [ HR]  = - exp(-aC/J))  (4.35) 


Max  [Knd]  = InJ 


0 


To  check  for  the  existence  of  other  stationary  points  on  the  surface,  we  return 
to  eq.  4.11.  That  equation  can  be  put  in  the  form: 


J 

V' 


acpk  (In  cpk  - In  L'  opi)  = k 
or,  by  further  manipulation: 


" Vn1'k  - G , 

where 


J 


“ 9 1,/  ^ cp^  ? G » \/a  S m, 

* * j-1  3 j=l  j 

J 

The  y.  must  satisfy  £ Y,  = 1,  and,  fora  solution  to  exist: 

k---l 


(4.36) 


(4.37) 


(4.38) 


0 j£  0 s 1/e  , 


(4.39) 


as  can  be  seen  from  figure  10. 

The  nature  of  the  solutions  can  be  argued  from  the  shape  of  the  curve  in  figure  10. 

J 

Assume,  first,  that  C = 1/e  . Then,  Y = 1/e,  and  V Y.  = J/e , which  is  less  titan 

K A-  K 

ks=  1 J 

1 for  J=2 , and  greater  than  1 for  J greater  than  2.  To  drive  y toward  1,  we 

k-1  k 

decrease  C.  For  J - 2 , the  points  must  move  down  the  right  branch  of  the  curve, 


-31- 


increasing  V,  , until  Y = 1/2,  G = 1/2  In  2.  For  all  J > 2,  the  points  must  move  down 

K K 

the  left  branch  of  the  curve,  decreasing  Y^,  until  Y^  = i/J,  G = (In  J)/J.  These  repre- 
sent the  solutions  = K,  already  discussed  in  equation  4.19  and  those  which  follow. 


If,  however,  we  continue  to  decrease  G,  with  all  Y,  on  the  left  branch  of  the  curve, 

J J 

23  V,  approaches  zero  in  such  a way  that  for  some  G*,  we  can  again  make  23  = A 

k=l  K k«l 

by  switching  any  one  of  the  Y,  to  the  right  branch  of  the  curve , The  condition  for  this  is 

K 

cp  = K 1;  cp  = K2;  j 4 k ej,  for  any  specific  k.  Because  of  the  shape  of  the  curve,  Y,  ~ 1, 
K J k 

Y ~ 0,  and  thu3  K1  > > K2. 

Let  K1  = pK2,  and  n = J-l.  Then,  from  eq.  4.37: 


[p/(p+n)]  f>,  [p/(p+n)]  = [l./(p+n)]  in  [l/(p+n)]  (4.40) 


The  second  equation  for  determining  the  two  free  parameters,  K1  and  K2,  Is  the 
constraint  on  total  search  effort.  Using  the  definitions  of  the  cpfc  : 


aC/J  » (1/J) 


J 


L k-1 


Bn  - Bn  p 


- On  K2  . 


(4.41) 


Eq.  4.40  cannot  be  solved  for  p(u),  but  it  can  be  solved  for  n(p)  : 


n(p)  = - p + exp  [ (p  Lv  p)/(p-l)] 


(4.42) 


This  equation  is  plotted  in  figure  11,  with  lnp  as  a function  of  n.  For  large  p : 


n ~ On  p + (1/p)  [ ( In  p)^/'2  + Bn  p]  + 0 (p  2)  . (4,43) 


For  r.  a 8 (J  * 9),  n ~ln  p,  p~en  within  a few  percent  of  accuracy.  It  is  Important 
to  note  from  figure  11  that  this  type  of  solution  exists  only  for  n > e-1  (J  > e).  This  means 
that  for  J = 2,  the  only  solution  is  the  one  with  both  Y points  on  the  right  branch  of  the 
curve  10;  for  ] *3,  however,  the  two  classes  of  solutions  exist:  a single  solution  (Type  1) 
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1 

■H 


with  all  Y points  on  the  left  branch  of  figure  10,  and  J solutions  (Type  2)  with  any  one 
of  the  J v points  on  the  right  branch  of  the  curve  and  the  other  J-l  on  the  left  branch. 

There  is  an  additional  condition  on  the  amount  of  search  effort,  C,  however.  Just 
as  there  was  a C*  threshold  for  the  existence  of  the  Type  1 solution,  there  are  J 
additional  thresholds  for  the  appearance  of  each  Type  2 solution.  This  iB  best  seen 
when  the  search  allocatio'  solutions  in  the  asymptotic  limit  are  calculated: 


zk  a 

(C/J)  + 

(1/a)  U pk  - (1/aJ) 

j 

S fa  Pn  - 
1*1 

(J-l) 2/aJ 

(4.44) 

2j  = 

(C/J)  + 

(1/a)  in  Pj  - (1/aJ) 

J 

S to  p.  + 
4-1  * 

(J-l)  /aJ 

(4.45) 

Since  the  z k * 0,  eq.  4.44  provides  the  threshold  criteria  for  C: 

2 J 

aC*  ^ (J-l)  + E In  p.  - J In  p. 

k " j-l  j * (4.46) 


k “ 1 . . • J 

This,  together  with  eq.  4.22,  provides  the  values  of  C for  which  each  of  the  J+l 
stationary  points  of  the  surface  first  appears.  In  the  case  of  equal  prior  probabilities, 

C*=0  from  (4.22),  and  aC*  equals  (J-l)2  for  all  k. 

Havi-  g found  the  approximate  locations  of  the  J additional  stationary  points  of  the 
Hd  surface,  we  now  iind  out  what  type  of  stationary  points  they  are.  Using  eq.  4.27, 

Ei 

we  find: 


a2V<^  = 


a K2  (1  - in  J) 


(4.47) 


for  those  coordinates  such  that  cp,  - cp  - K2  . Thus,  for  J >2,  we  have  a minimum  of 

* J 

Hd  in  J-l  of  the  coordinates.  For  the  remaining  case,  cp  = Kl,  cp  = K2. 

F K j 


52HE/azJ  = - a2K2  [ (n  + 1)  - 2 <n  in  p)/(p- D 
- (m  — 1 ) 2 exp  (- (n  U p)/ (u-1)  ) ] 


(4.48) 
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where  eq.  4.42  has  been  used  to  eliminate  n.  Direct  calculation  for  1 £p  £ " shows 
2 

that  6 II  * 0 for  all  p,  This  result  seems  to  suggest,  paradoxically,  that  all  J+l 
stationary  points  sre  minima. 


To  resolve  this  apparent  paradox,  we  must  be  considerably  more  careful  with  our 
coordinates.  We  seek,  in  particular,  at  least  one  linear  combination  of  the  such 

that  its  second  derivative  has  the  opposite  sign  at  the  stationary  point. 

Let  us  introduce  a variable  x,  0 < x * 1,  such  that: 


z.  = (1-x) 


J 

(C/J)  + (1/a)  fci  p.  - (1/aJ)  S an  p# 

K 1=1  *■ . 


(4.49) 


2.i  ~ 


(C/J)  + (1/a)  6n  p.  - ( 1/aJ)  s in  p. 


£■1 


(4.50) 


+ (x/(J-l)) 


(C/J)  + (1/a)  In  pv  - (1/aJ)  Stop. 

K 1=1  h 


j / k e J . 

For  ease  of  notation,  define: 

J 

3^/a  ■ (C/J)  + (1/a)  bt\  - (1/aJ)  S to  Pj 

Then  equations  4.49  and  4.50  become: 

“ (1-x)  (3^,/a) 


(4.51) 


(4.52) 


z,  - (3  ./a)  + (x/(J-l))'3k/a)  . (4.53) 

3 J K 

By  introducing  this  parameter,  we  achieve  several  results.  Note  that  when  x=0: 


zk  - 0k/a  '•  zj  - va 


(4.54) 
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These  are  the  solutions  for  optimal  detection  search  from  eq.  4.24.  When  x=l,  z, -0, 
and:  ‘ 


J-l 

z,  = C/(J-1)  + (1/a)  fin  p.  - (1/a (J-l) ) S fcp, 
j 3 e/k  * 


(4.55) 


The  latter  are  recognized  as  the  optimal  detection  search  allocations  in  J-l  cells,  when 
cell  k is  not  searched.  These  represent  the  class  of  whereabout  searches,  whose  aim 
is  to  locate  an  object  by  searching  all  cells  but  one,  then  guessing  the  position  on  the 
basis  of  the  search  outcome.  Of  this  class,  the  one  for  which  p.  = Max  (p  ) in  the 

J J 

unsearched  cell  is  called  the  "optimal  whereabouts  search,"  This  search  maximizes  the 
probability  of  correctly  guessing  the  object's  location  (reference  10), 

Finally,  for  x = (J-l)Vj/3k  we  find: 


zk  » g^/a  - (J-l)  /aJ 


(4.56) 


Zj  ■ gj/a  + (J-l)/aJ 


(4.57) 


These  are  identical  to  the  asymptotic  forms  of  the  positions  of  the  stationary  points  of  the 
HE  surface,  determined  in  equations  4,44  and  4.45, 

Thus,  by  use  of  the  parameter  x,  we  have  introduced  a set  of  J hyperplanes.  Each 
of  these  hyperplanes  connects  (a)  the  Type  1 stationary  point  corresponding  to  the  allo- 
cation for  optimal  detection  search;  (b)  one  of  the  J Type  2 stationary  points;  and 
(c)  one  of  the  J boundary  points  corresponding  to  a whereabouts  search.  We  now  investi- 
gate the  curve  formed  by  intersection  of  these  J hyperplanes  with  the  H surface. 

Using  equations  4.52  and  4.53,  we  can  put  H in  this  form: 


he  =(exp  - [ (C/J)  - (1/aJ)  k p*])l  - 3kx  [exp(0kx)-exp(-pkx/n)  ] 

\ (4.58) 

+ Cexp(|3kx)  + n exp(-gkx/n)]  £n  [exp(3kx)  + n exp  (-pkx/n)  ]?  . 


i 


By  computing  dll^/dx  = 0,  it  can  be  shown  that  an  equation  identical  to  eq.  4.42  is 
obtained,  with  p now  identified  as: 


exp  CO^x)  (n+l)/n] 


(4.59) 


Its  solution  is  again  the  curve  shown  in  figure  11,  and  the  asymptotic  form  leads  to: 


x = n / (n+1)  {(3k) 


(4.60) 


This  is  indeed  the  position  of  the  Type  2 stationary  point  as  determined  earlier. 

The  second  derivative  of  at  the  stationary  point  is: 

d2HE/dx2  ~ [ (p  &tp)/(p-l)  + p2  exp[-(p  Sm  p )/ (p-1)  ]- (p+1)  ] . (4.61) 

In  contrast  to  eq,  4,48,  this  is  negative  for  all  1 £ p < « . Thus,  along  the  curves 
generated  by  the  J hyperplanes,  H has  a max  at  the  stationary  point,  and  we  may 

safely  describe  the  J Type  2 stationary  points  as  saddle  points. 

Since  the  entropy  has  a max  at  the  type  2 stationary  point  along  the  x path,  and 
then  turns  downward  again,  we  may  ask  whether  it  later  reaches  a value  below  the 
minimum  at  the  optimal  detection  search  allocation.  In  fact,  it  does,  if  C is  large 
enough.  To  find  this  threshold  of  C , we  simply  equate  H (O)  = H_(l),  yielding: 


Jfc  J s [exp(0k)  + n exp(-0k/n)]  Cm  [exp(0k)  + n exp(-0k/n)] 

(4. 

- Pk[exp(0k)  - exp(-0k/n)] 

In  the  dual  limit  J >>  1,  jS^>>  in  (J-l),  this  equation  can  be  solved  approximately: 


(4.62) 


aCEQ  = S In  p^  - J In  pk  + J(J-l)  In  J . 


(4.63) 


l I 


The  minimum  of  this  over  the  set  J is: 


aCEQN  = p ln  pit  " J ln  PMIN  + J(J_1)  ln  J * (4>64) 


MIN 

For  any  C > C , there  is  a whereabouts  search  that  yields  more  expected  information 
liQ 

than  does  the  optimal  detection  search. 

We  can  now  give  a complete  description  of  the  expected  information  surface  in  allo- 
cation space,  with  the  total  search  effort  C as  a parameter: 

1.  For  C <C*  (eq.  4,22),  1^  has  no  stationary  points . The  extreme  values 
of  I occur  on  the  boundaries, 

2.  For  C*  < C < C£  (eq.  4.46),  and  for  J > 2,  I£  has  only  one  stationary 

point.  That  point  is  both  relative  and  absolute  maximum,  and  it  occurs  at  the  allocation 
of  search  effort  corresponding  to  optimal  detection  search. 

3.  As  C increases  through  the  series  C*  (k  - 1,  2,  --  J),  J new  stationary 

points  appear  on  the  I surface.  All  J are  saddle  points,  whose  coordinates  are  given 

approximately  by  equations  4.44  and  4.45,  The  one  maximum  point  remains  both  relative 
and  absolute  maximum. 

MIN 

4.  When  C reaches  C (eq,  4.64),  the  relative  maximum  is  no  longer  the 

by) 

absolute  maximum . The  new  absolute  maximum  of  I„  occurs  for  the  search  allocation 

E 

corresponding  to  the  optimal  whereabouts  search.  As  C increases  still  further,  new 
border  regions  are  added  to  the  surface,  for  which  1^,  exceeds  the  local  maximum. 

The  optimal  whereabouts  search  allocation  remains  the  absolute  maximum  of  the  surface, 
however. 

Finally,  it  is  interesting  to  compute  the  values  of  P and  of  the  expected  information 

MIN  . ...  D 

at  , where  the  crossover  occurs. 

EQ 

These  are,  for  optimal  detection  search  allocation: 


PD  = 1 " PMAX  J eXp!_JC/l  J)  ' 


(4.65) 


for  optimal  whereabouts  search  allocation: 


PD  ^ PMAX^  “ PMAX  (J-1)  exP(“J  &/1  > 


(4.66) 
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and,  for  the  information  in  either  case: 


^MAX  ” IE^'/IMAX  “ HE//H0  = ^MAX^O^  J eXp  ^ * (4.67) 


From  these  it  can  be  seen  that  for  even  modest  J (J  = 5,  say)  PQ  is  extremely  close  to 
1,  and  Ig  is  extremely  close  to  1^^,  by  the  time  the  crossover  occurs. 

We  return  now  to  the  case  in  which  the  are  not  equal.  This  corresponds  to 

Pollock's  example.  In  this  case,  the  allocations  (z^)  that  determine  the  extrema  of 

P , and  H-  are  given  by  the  solutions  of  equations  4.9  - 4.11,  They  are,  of 

D ND  E 

course,  not  identical. 

Using  the  Lagrange  multiplier  notation,  we  obtain  the  following  equations.  For 

4pd*#! 


akcpk.  ' 


all  k £ J ; 


(4.68) 


for  6 H = 0 : 
ND 


ak^k 


for  i H = 0 : 
E 


(In  cpk)  / B cpj’  - cpj  In  cp  / ^B  cPj^ 


XND  1 


(4.69) 


akcpk  £ln  V]*  " ln  B ‘Pj 


= \ 


E * 


(4.70) 


The  solution  to  eq.  4.68  is  straightforward  ar.d  well  known: 


(4.71) 


zk  s(&iakPk)/ak“ 


T,  J \ J 1 J 

^B  ( in  ajPj)/ajJ/ak  ^ (1/a  j )J  + C/ak 


Equations  4.69  and  4.70  are  transcendental  equations  and  appear  more  formidable, 
but,  In  point  of  fact,  only  eq.  4.70  is  so  formidable.  If  we  divide  eq.  4.69  by  and 
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;) 

}i] 


then  sum  on  the  index  k,  we  find  that; 


j / d \ j ^ ry  ^ 

(1/a.J  * ( D cpv.  to  cpv )/  2]  cp  , - ( 2 cpu  £ cp  . On  cpA 

k-1  K Ujc=i  K K/  j-l  JJ  Uk„!  * j = i 3 v 


(4.72) 


C^)2]  = 


= 0 . 


Since  the  l/a^  are  ail  nonzero  quantities  of  the  same  sign,  we  must  have: 
XND  " 0 * 


(4.73) 


Putting  this  back  into  eq.  4.69,  we  get  a much  simplified  equation.  Since  is  not 

zero,  we  must  have: 


J \ J 

In  cpk  - ( S^cp  j In  cpjj/ 


(4.74) 


Because  the  right  side  is  independent  of  k,  the  only  solution  to  this  set  of  equations  is: 


cpk  » const  - K , for  all  k £ J . 


(4.75) 


This  is  Identical  in  form  to  the  result  found  in  the  case  of  equal  . But  the  solution  for 

z*  is  somewhat  different: 
k 


*ND  , „ , 

z k = ( pk5 


/ok  - (2M  an  Pj ) /“•  j)/ak  LMi/ajjJ 


(4.76) 


C/ ( 1/a.J 
K j-l  11 

The  difference  between  the  solutions  (equations  4.71  and  4,76)  is  a function  of  the 

only,  and  does  not  depend  on  the  prior  probabilities  pk  , Specifically: 

* t.  ..m  r / *7  \ 3 T 


. __  *P  *ND 

Ak  * zk  ■ zk 


= (l/ak)  an  ay.  - ( J (fin  a j ) /a  j )/  ZJ  (1/a  j )J  . (4.77) 


Note  that  when  all  are  equal,  both 
the  ■ 0 for  all  k. 


*P  , 

7.  and 
K 


reduce  to  z (eq.  4.24),  and 

K 


Equations  4.71  and  4.76  can  be  used  to  calculate  the  optimum  allocation  of  search 
effort  in  Pollock's  modified  example.  The  analytic  solutions  are,  for  P^: 


Zj  = 1.497 

z2  » 11.334 
z3  = 17.169 

and,  for  H^: 

z^  = .996 

z2  = 11.126 
z = 17.878  . 

These  values  should  be  compared  with  those  found  in  section  3 to  be  the  maximum 
points  on  the  integer  grids:  (2,  11,  17)  and  (1,  11,  18),  respectively. 

It  is  significant  that  the  analytically  determined  maxima  of  PQ  and  are  fairly 

close.  With  cv1  large,  Pollock's  example  represents  a major  departure  from  a uniform 

probability  of  conditional  detection.  Nevertheless,  the  maxima  show  only  a small  dis- 
placement from  each  other,  although  they  show  a large  displacement  from  their  locations 


taking  Oj  = a = .693  for  all  j,  yields  z^  = 8.61,  z2  = 10,19,  z^  = 11.20).  Thus,  in 
practical  situations,  optimizing  lead  to  search  allocation  nearly  equal  to  that 

required  for  optimum  detection  probability, 


These  solutions,  too,  are  valid  for  large  search  efforts  only.  The  thresholds  for 
their  validity  are  determined  from  equations  4.68  and  4,75,  using  arguments  analogous 
to  those  previously  given.  The  results  are,  for  P^: 


* 

C 


t_y 

j=i 


ttjpj 


/a 


i ' (&<1/aJ)) 


Bn  [ap] 


min 


(4.78) 
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and,  for  H 


ND' 


....  S' 


£ ( bn  pd/a.  - ( £ (l/a,))  bn  Pmin 
j=l  J J v j=l  J ' 


(4.79) 


Of  greater  theoretical  significance  than  the  z^  are  the  relative  fractions  of  the 
total  search  effort  assigned  to  each  cell: 

= vc  • 

These  are  trivally  determined  from  equations  4.71  and  4,76: 

J 


^kP  = (1/ak  j^1(1/aj))  + (1/C)  [(  071  akpk)/ak 

- ( £ ( bn  a.,p.J /a.:)/ak  £ (1/a. 

\j«l  J ^ 3/  K j«i  J 


(4.80) 


CkND  " (1/ak  £ U/ttj))  + (1/0  [(  Bn  Pk)/ak 
- ( £ (bn  p j ) /a  j j/ak  £^(l/a..)J 


These  clearly  converge  to  the  same  asymptotic  limit  as  C” 
*ND  _ 

'j1 


*P  . *ND  , , v.  M . 

Ck«  ~ £ k®  * 1/ak  ^ (1/a- 


j=l 


(4.81) 


(4.82) 


No  simple  strategem  is  available  for  dealing  with  equation  (4,70),  and  closed -form 
solutions  for  the  optimum  search  allocation  do  not  appear  possible  in  the  case  of  H . 

Some  limited  statements  can  be  made  about  the  nature  of  the  solutions,  however.  If  we 
eliminate  from  any  two  equations  ineq,  4.70,  and  use  rather  than  , we  get: 
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[aipi 

expt-a.C.C) 

£n 

(pi  exp(-UiCiC)j 

- 

J 

/ p, 

j=l  3 

exp(-ajC jC) 

akpk 

exp (-akC kC) 

fan 

(pk  exp(_akckC)y 

/£  p, 
j»l  J 

exp (-a jC jC) 

This  can  be  further  manipulated  into  the  form: 


(ai/ak> 


ftPi/Pk)  exp(-C(akCk-aiCi))l  . to  [ S (Pj/Pk)  exp  (-C  (a  jC  j-akCk) 
L J L3!=1  (4.84) 

/ toi  £ ZMPj/pt)  exp  (-C  (ftjC  j"aiSi))J 

Since  the  left  side  of  this  equation  is  independent  of  C,  the  right  side  must  be,  as 
well,  if  the  equation  is  to  remain  valid  in  the  asymptotic  limit  C *♦  “ . 

Because  the  a,  values  are  given  and  independent  of  C and  because,  by  definition, 

k o 

0 < Cj  si,  then  , must  be  0(C  ) in  its  leading  time.  The  most  general  form  that 

satisfies  these  requirements  is: 

J 

Ci  = 1/a,  D (1/a.) 

1 L 1 j-1  3 J 


+ (fi/ai  C) 


(4.85) 


This  renders  eq.  4,47  independent  of  C,  to  all  orders,  but  still  makes  C of  order 


C . The  functions  f^  are  functions  of  the  ay  and  p ; they  must  satisfy: 

[ajP*  exp(-fi)J  Cn  J^Pj_  exp  (- f £ ))  / IyP  j exp(-fj)J  = 
and 


v'  (f4/a4>  « 0 
j-1 


(4.86) 


(4.87) 
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An  explicit  solution  for  the  f of  eq.  4,86  is  no  more  feasible  than  an  explicit  solution 

for  the  cpj  of  eq.  4.70.  But,  by  the  artifice  of  eq.  4.85,  we  have  separated  out  the 

asymptotic  part  of  the  solution.  This  leads  to  the  very  important  Theorem  IV: 

Theorem  IV:  Under  the  assumptions  of  section  2.1,  the  allocations  of  the  fraction 
of  search  effort  among  the  J search  cells  that  produce  stationary  points  of  Pp,  H^, 

and  H have  a common  asymptotic  limit  as  the  amount  of  search  effort  C approaches 

infinity.  That  limit  is: 


C 


(1/a  j 


) 


(4.88) 


The  final  topic  to  address  in  this  section  is  whether  the  extrema  are  maxima  or 
minima  in  the  case  of  differing  a . This  is  easily  answered  in  the  case  of  and 

■Hj  — . Combining  equations  4, 25  and  4. 68,  we  get: 


62pd  ~ -Jtp(ak  + aj)  for  all  k e J . (4.89) 

Ip,  a^,  and  are  all  positive,  and  P^,  therefore,  always  has  a maximum. 

Similarly,  when  we  combine  equations  4.26  and  4.75: 

62Hnd  ~ -2akaJ/J2  - (a2  + a2)  [ (1/J)  - (1/J2)]  (4.90) 

This  is  always  negative  for  positive  integer  J;  therefore,  H^,  too,  always  has  a 
maximum. 

. 2 

No  general  rule  appears  appropriate  for  application  to  6 H^.  Though  analytical 

results  are  not  forthcoming,  we  have  already  seen  in  the  numerical  examples  that 
maxima,  minima,  and  saddle  points  are  all  possible  as  the  stationary  points  of  the 
surface. 
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5.  A MA LYSIS- -INFINITESIMAL.  SEARCH  EFFORT 


We  turn  now  to  the  opposite  limit,  in  which  the  total  amount  of  search  effort  approaches 
zero.  In  particular,  vve  consider  the  case  In  which  an  infinitesimal  amount  of  search 
effort  Az^  is  applied  in  only  one  cell,  k £ J.  We  then  apply  the  usual  limiting  processes 

of  calculus  to  compute  the  rates  of  change  of  H , and  as  a result  of  searching 

In  cell  k. 

The  quantities  A and  D,  defined  in  equations  4.1  and  4,2  are,  for  search  in  a 
single  cell: 

A - -H0  - (Pk  In  Pk)[l  - exp(-akAzK)]-akAzkpk  exp{-akAzp  ^5,1) 

B - 1 - pk  [1  - exp(-Vzk)] 


where  H has  been  defined  in  eq,  2,17.  When  A and  11  are  expanded  to  first  order 
in  Az^  and  substituted  in  equations  4.3,  4.4,  and  4.5,  we  obtain: 

PD  = (A  VW  + 0(Azk} 

HND  = H0  + (AVakPk(H0  + ln  V 
+ 0(Azk) 

HE  ‘ H0  + AZkak<Y  1,1  pk  ' 

The  nee : 


(5.3) 

(5.4) 

(5.5) 


lim  (Pd-P0)/Azr  * clPD/dzk  - akpk  (5.6) 

Azk-0 

lim  (hnd"H0)  //Azk  ""  ^ND^^^k  = akpkIH0  + ln  pk*  (5*7^ 
Azk-° 

(Hi.rHo)/Ak  - dVdzk  " akpk  ln  pk 


lim 

Azk-° 
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(5.8) 


Our  basic  concerns  will  .,e  with  the  magnitude  of  these  rates  of  change,  and  with  the 
extreme  vrlues  of  the  magnitudes  over  the  set  J;  we  shall  determine  these  extreme  values. 
First,  however,  some  observations  about  the  signs  are  significant.  Recalling  that 
0 & a <»  , and  0 £p  £1  for  all  k £ J,  we  note  immediately  that: 

N K 


dPD/dzR  s 0 ; all  k 6 J (5.9) 

dH  /dz.  £ 0 f all  k € J . (5.10) 

E K 

The  corresponding  result  for  dHND/dzk  is  less  obvious.  However,  the  reader  may 
persuade  himself  by  selected  numerical  examples  that  dH^/dz^  can  be  either  positive 
or  negative,  depending  on  the  set  of  the  prior  probabilities  p and  on  the  searched  cell, 

J 3 

k.  (Alternatively,  we  may  note  that  (l/o^)  dH^/dz^  = 0,  implying  that  the  signs 

k_i 

of  dH..„/dz,  are  not  the  same  for  all  k. 

ND  k 

The  observation  concerning  dPD/dzk  is  scarcely  surprising.  But  those  related  to 
the  entropies  are  significant: 

Theorem  V:  For  any  search  operation  conducted  according  to  the  assumptions 
of  section  2.1,  the  expected  entropy  never  increases  (and  the  expected  information  never 
decreases),  regardless  of  the  cell  chosen  for  search.  The  entropy  that  is  conditioned  on 
nondetection  of  the  target,  however,  may  either  increase  or  decrease,  depending  on  local 
conditions . 

It  is  worth  pointing  out  that  the  rates  of  change  of  P and  H„  depend  entirely  on 
local  conditions,  i.e. , on  the  prior  probability  in  the  cell  being  searched:  the  rate 
of  change  of  H^,  on  the  other  hand,  depends  globally  on  the  entire  prior  ensemble, 
through  Hq.  It  is  therefore  conceivable  that  for  two  distinct  prior  ensembles,  the  rates 
of  change  of  could  vary  in  both  magnitude  and  sign,  even  though  the  individual  cells 

chosen  for  search  In  the  two  cases  had  identical  prior  probabilities. 

We  now  determine  whether  the  results  obtained  in  section  4 still  apply  in  the  limit 
Az^TO.  Specifically,  does  the  allocation  that  maximizes  P^  also  maximize  and 

produce  an  extremum  (either  maximum  or  minimum)  in  H ? 
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We  have  already  noted  that  in  the  case  in  which  the  a are  different  in  different  cells, 

K 

no  results  of  useful  generality  were  obtained.  We  therefore  restrict  ourselves  to  the  case 
all  kC  J,  and  --  for  simplicity  in  this  section  --let  a = 1.  Then,  the  rates  of 

change  for  P^,  H__,  and  1 L are: 

D ND  b 

Vpk(Ho  + ln  pk)(  and  pk  ln  pk  ' 


respectively. 


Let  us  assume  that  the  cells  are  numbered  in  order  of  nonincreasing  magnitude  of 

p,  ; p 2 P„  * - - - 2p.  20.  By  this  device,  we  make  sure  that  infinitesimal  search  in 
k *1  2 ‘J 

cell  1 will  produce  the  highest  rate  of  increase  in  P^,  and,  hence,  maximize  P^  in  the 
limit  Az^O.  We  then  investigate  the  extrema  over  the  set  J for  the  rates  of  change  of 

hnd  and  V 


H. 


ND 


We  noted  earlier  that  the  sign  of  dll^^/dz^  might  lie  either  positive  or  negative, 

depending  on  the  prior  ensemble  and  on  the  searched  cell,  k.  This  general  observation 
can  be  made  somewhat  more  precise.  Some  preliminary  observations: 

1,  For  a given  prior  ensemble,  !I^  Is  a fixed  positive  constant. 

2.  The  factor  p^  is  monotonically  non  increasing  with  the  index  k because 
of  the  assumed  ordering.  It  is  always  >0. 


3.  The  factor  II  f lap  is  munotonieally  nonincreasing  with  the  index  k 

w K 

from  some  Initial  value,  and  mnv  approach  -ro  for  a p that  is  small 

K 

enough.  Whether  the  maximum  value  of  ll^  i-  ln  Pj.  is  positive  or 
negative  remains  to  lx?  determined. 


We  shall  later  prove  that  the  max  of  11  4 In  p Is  in  fact  nonnegative  for  any  prior 

w K 

ensemble.  For  the  moment,  we  assume  that  this  is  true  and  derive  the  consequences. 

I3y  the  assumption,  there  is  a range  of  k,  1 < k s K •'],  for  which  11^  + In  0, 
Then,  in  this  range,  p (H  + ln  p ) is  a positive,  monotonically  nonincreasing  function 

K U K 

of  k,  because  it  is  a product  of  two  positive,  inonotouically  nonincreasing  functions  of  k. 

-48- 


In  tills  range: 


P1(Hq  + In  px)  2 Pk(HQ  + In  pR)  (5.11 

for  1 <k  * K.  Outside  this  range  H ^ + In  < 0 , and  the  monotonicity  of  the  product 

is  no  longer  assured.  Since  the  product  is  then  negative,  however,  it  is  obviously  lees 
than  p1(HQ  + In  pp.  Thus,  if  p1  * p^  for  any  1 <k  sj,  then  P1(HQ+  lnp^* 

pk(H0  + ln  Pk) for  any  1 < k <f  • 


Now  we  prove  the  assumption.  We  need  to  show  that  for  any  prior  distribution,  there 
is  at  least  one  pk  such  that  HQ  + ln  p^  is  nonnegative.  This  is  easily  done  by  noting 

that: 


u “ 

H_  + in  p.  a H Pj  (-  In  Pj  + fin  Pk)  = ^ Pa  ( 3n(Pj,/P.i)  ) 
° K j = l J J j = l ■> 

IS  p.  is  Max  p , then  each  term  in  the  sum  is  nonnegative,  and 

J J 


(5.12) 


H , + p iO. 
0 ^niax 


(5.13) 


One  final  comment  about  dH^/dz^.  We  have  seen  that,  for  the  largest  p^  in  J, 

d^ND^Zk  * 1°  addition,  for  sufficiently  small  values,  dH^D/dzk~,0  . This  sort 

of  behavior  suggests  that  for  some  intermediate  value  of  p^,  dH^/dz^  has  a relative 

minimum  < 0 (which,  ln  fact,  is  an  absolute  minimum,  too).  This  Is  best  Illustrated  by 
a specific  model. 

Consider  a countably  Infinite  set  of  search  cells,  ln  which  the  (ordered)  prior 
probabilities  are: 

pk  = (1  - y)  Yk  1 (5.14) 

00 

Clearly  p = 1 when  the  sum  converges  (0  < y < 1). 
k=l  k 
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This  model  is  a useful  tool  for  investigating  some  situations  involving  finite  J, 
General  analytical  results  can  be  obtained  with  the  infinite  model,  yet  that  model  can 
be  made  to  approximate  a finite  model  reasonably  well  by  truncation  of  the  series 

CO 

after  J-l  terms  and  lumping  all  the  remaining  probability  J2  P,  together  as  the 

k=J 

Jth  term.  For  Y < 1/2,  the  total  residual  probability  in  (J,  “)  is  less  than  the  J-l 
term,  and  the  decreasing  ordering  is  preserved. 

For  this  model,  with  a = a = 1,  we  can  show  that: 

dHNn/dzk  = (1/y  ) 1 y | exp  {-k |foi  y | ) [ 1 - k(l-y)]  , (5.15) 

This  exprescion  is  positive  for  small  k and  negative  for  large  k.  The  crossover 
point  is  given  by: 

k+(y)  - 1/d-y)  (5.16) 

which  is  monotonically  increasing  from  0 to  “ as  Y goes  from  0 to  1 . Thus,  by 
choice  of  the  parameter  Y , we  can  make  an  arbitrarily  large  number  of  cells  have 
positive  values  of  dll^^dz^  . 

Treating  k as  a continuous  variable  and  differentiating,  we  find  that: 


a(dHND/dzk)/dk  = 0 for  k = k*  , (5.17) 

where: 


k*(y)  - 1/d-Y)  + -1/  ( |r--i  y|)  . (5.18) 

This  can  be  shown  to  be  a minimum  of  dll  /dz  . Like  k4(y),  k *( Y)  is  also  a monotoni- 

IN  1..J  K 

cally  increasing  function  of  Y , going  from  1 to  + « as  Y goes  from  0 to  1 , The 
position  of  the  minimum  can  also  be  situated  at  arbitrarily  large  values  of  k,  depending 
on  the  choice  of  Y , although  clearly  the  choices  of  k (Y)  and  k*(Y)  are  not  Independent 
of  each  other. 


An  analysis  of  the  relative  magnitudes  over  [k}  of  p^  In  is  closely  related  to  a 

problem  already  solved  by  Browning  (reference  11).  Browning  treats  a slightly  more 
complicated  case,  in  which  the  amounts  of  search  effort  are  finite  ("looks")  rather  than 
infinitesimal.  That  makes  his  proofs  more  complex  than  the  present  case  requires,  but 
it  does  not  alter  their  validity  or  their  relevance  to  the  infinitesimal  limit. 

We  shall  simply  state  the  necessary  results  with  qualitative  justifications.  The 
reader  may  consult  Browning  for  analytical  proofs. 

We  shall  consider  the  rate  of  change  of  the  expected  information,  rather  than  the 
expected  entropy:  dl  /dz,  = -p  In  p *0.  Figure  12  shows  the  function  y(p)  = -p  In  p 

ti  K K K 

in  the  domain  0 <p  si.  This  curve  is  asymmetrical,  having  a maximum  at  (1/e,  1/e), 

J 

We  must  visualize  along  this  curve  the  set  of  points  y ■ y(p  ),  subject  to  p = 1 , and 

iv  * 1 , 1C 

k=l 

to  the  ordering  p^  1 P2  s P3  * “ ” “ * Pj  0. 

Browning  has  shown  that  the  following  cases  should  be  distinguished: 

1.  1/e  * p1  . In  thic  case  all  the  y^  are  on  the  increasing  part  of  the 
curve  y(p);  therefore,  the  ordering  of  the  p^  guarantees  the  ordering  of 

to  yk:  y,»y2*y3»- - -*yk*°.  j 

2.  1/e  <p„  . In  this  case,  which  can  still  satisfy  the  constraint  Vp,  si,  y. 

L . , K X 

k*=l 

is  on  the  decreasing  segment  of  y(p);  therefore,  y2  2 Yj  • 

3.  < 1/e  . This  case  is  problematical  and  is  best  resolved  by  numerical 

calculation. 

Before  turning  to  relevant  calculations,  we  must  resolve  the  question  whether  y could 

u 

be  the  largest  of  the  y^  under  any  conditions . Reference  to  the  figure  shows  this  is 
obviously  not  possible  if  p2  £l/e  , since  then  £ y2  . On  the  other  hand,  if  both  p^ 
and  p are  * 1/e  , then  the  maximum  value  possible  for  p.  is  (1-2/e)  <l/e,  attained 
when  p1  = p2  = 1/e.  There  again  y3  <y^,  y2  • Any  increase  in  p^  and  p2  above  1/e 
can  occur  only  at  uie  expense  of  p^,  which  further  decreases  y^  at  a rate  faster  than 
either  y^  or  y2  because  of  the  steeper  slope  for  p < 1/e.  Thus,  there  are  no  conditions 


y 


.3 
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FIG.  12;  GRAPHICAL  ILLUSTRATION  OF  THE  FUNCTION: 

y = --  p (Vi  p 


for  which  y = Max  y ; we  need  therefore  consider  only  cells  1 and  2 when  allocating 

vj  , K 

k 

search  effort  to  produce  the  maximum  rate  of  information  increase. 

Figure  13  shows  the  relevant  portion  of  the  Pj,  p2  plane.  The  boundary  curves 

J 

p^  = 1),  Pj  = p2  (the  limiting  case  of  the  assumed 

ordering  px  * p2),  and  y(pl>  = y(p2>.  In  region  I,  y1  >y2;  in  region  II,  y 2>Vl  • 

It  is  important  to  note  that  the  line  p^  + p2  = 1 is  a boundary  of  region  II  only.  This 

implies  that  for  the  case  J=2,  y0  > y^,  except  for  the  endpoint  y2(0)  = * 0.  When 

there  are  only  two  search  cells,  the  maximum  rate  of  increase  of  information  is  always 
, attained  by  searching  the  cell  with  the  lower  prior  probability,  and  the  minimum  rate  of 
I increase  of  information  is  always  attained  by  searching  the  cell  with  the  higher  prior 
I probability.  This  is  in  agreement  with  the  results  obtained  for  the  asymptotic  case 
C -*  “ . 


are  p^  + p2  = 1 (the  limiting  case  of  £ 


When  J > 2,  however,  a new  feature  enters.  There  is  a significant  region  in  the 
P1  , p2  plane,  where  the  maximum  rate  of  information  increase  is  attained  by  searching 

in  the  cell  with  the  second  highest  prior  probability.  The  cell  with  the  highest  prior 
probability,  which  is  searched  to  maximize  the  growth  of  PD»  does  not  represent  either 

a maximum  or  a minimum  in  the  rate  of  information  growth.  The  results  of  the  asymptotic 
analysis,  therefore,  do  not  carry  over  into  the  infinitesimal  limit,  and  we  cannot  assert 
universally  that  the  allocation  that  maximizes  PD  also  produces  an  extremum  in 

He  (or  IE). 

Reference  to  our  infinite -number -of -cells  model  will  illustrate  some  of  these  points. 
For  that  model: 

dlg/dz^  = - (1/y)  (1-y)  exp  (-k|&i  y I ) < 1- y ) + (k-1)  i’/i  y]  r (5.19) 


which  is,  of  course,  always  ^ 0 for  any  k.  Again,  treating  k as  a continuous  variable, 
we  find  that  dlg/dz^  has  a maximum  at: 

k|(y)  =>  1 - Oi(l-y)  ]/  Cm.  y - 1/  Hn  y , (5.20) 
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p,  : Prior  probability  In  cell  1 


FIG.  13:  DELINEATION  OF  REGIONS  IN  THE  SPACE 
OF  PRIOR  PROBABILITIES 
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The  behavior  of  k*(v)  differs  significantly  from  that  of  k*  (Y)  . As  y goes  from 
E ND 

0 to  1 , k*(Y)  starts  at  1,  increases  slightly  to  a maximum  of  —1,5422  at  Y®  . 351575 

then  decreases  monotonically  toward  Of  the  positive  integers,  only  k = 1 or  2 are 
closest  integers  to  this  curve  for  any  Y , as  suggested  by  our  previous  discussion.  Thus 
only  cells  1 or  2 need  be  considered  candidates  for  Max  {dl_/dz.  ] . 

k E k 

The  results  of  this  section  are  summarized  as  follows.  For  ordered  probabilities 
P1  ip2  * P3  ^ ‘ ' " * PJ  * ° 1 

a.  The  rate  of  change  of  is  always  positive.  It  has  a maximum  for  search 
in  cell  1 and  a minimum  for  search  in  cell  J. 

b.  The  rate  of  change  of  HND  may  be  either  positive  or  negative.  It  has  a 

nonnegative  maximum  for  search  in  cell  1,  and  a negative  minimum  for  search 
in  some  intermediate  cell  m:  1 < m < J , 

c.  The  rate  of  change  of  I is  always  positive.  If  J=2,  it  has  a maximum 

h 

for  search  in  cell  2 and  a minimum  for  search  in  cell  1.  If  J > 2,  then,  either 
(1)  it  has  a maximum  for  search  in  cell  1 and  a minimum  for  search  in  cell 
J,  or  (2)  it  has  a maximum  for  search  in  cell  2 and  a minimum  for  search  in 
either  cell  J or  cell  1. 


6.  SUMMARY 


Y 


The  relation  between  search  theory  and  information  theory  remains  complex. 

Previous  attempts  to  attack  the  problem  have  focused  on  only  a narrow  part  of  the 
relationship.  Their  narrow  focus  and  the  lack  of  clear  definition  of  "Information"  or 
"entropy"  created  an  unwarranted  impression  that  the  body  of  work  was  contradictory, 
if  not  incorrect.  It  is  not. 

The  work  reported  in  the  earlier  references  is  both  correct  and,  when  viewed  from 
a broader  perspective,  consistent.  The  individual  results  reported  earlier  have  been 
useful  in  establishing  this  broader  perspective . 

In  the  present  paper,  some  of  these  earlier  results  have  been  rederived  from  a 
different  viewpoint.  Other  new  results  have  been  added.  Taken  together,  these  findings 
are  enough  to  allow  us  to  sketch  a coherent,  though  still  incomplete,  picture  of  the 
relation  between  search  theory  and  information  theory. 

Equally  important,  we  can  now  identify  the  gaps  somewhat  more  clearly  and  can 
direct  future  research  toward  those  gaps  with  the  confidence  that  there  is  an  underlying 
body  of  theory  to  be  completed,  rather  than  a mere  collection  of  isolated  observations. 

Before  summarizing  the  findings,  it  is  important  for  us  to  recapitulate  the  assumptions. 
These  are  the  most  critical  assumptions  of  our  model,  which  are  used  consistently 
throughout  this  paper:  There  Is  a single  target;  It  is  stationary;  no  false  detections 
occur;  and  the  detection  process  Is  governed  by  an  exponential  detection  function. 

Relaxation  of  any  of  these  assumptions  would  add  considerable  complexity  to  the  theory 
and  might  change  the  conclusions  significantly.  Within  the  framework  of  these  general 
assumptions,  this  paper  has  examined  subcases  In  which  (a)  the  amount  of  search  effort 
is  either  very  large  or  very  small,  and  (b)  the  conditional  detection  probability  is 
either  uniform  or  variable  throughout  the  search  cells. 

Summary  of  Findings 

a.  Uniform  conditional  detection  probability . 

1.  The  allocation  of  search  effort  that  maximizes  the  posterior  entropy 

conditioned  on  nondetection  also  maximizes  the  probability  of  detection 

Pjy  This  result  was  first  established  by  Barker  and  was  confirmed  by  the 

present  author  in  both  the  large  and  small  search  effort  limits.  Since  Barker's 

proof  is  not  restricted  to  these  limiting  cases,  the  result  is  valid  in  the  Inter- 
mediate range  as  well. 
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2.  In  the  large  search  effort  limit,  the  allocation  of  search  effort  that  maximizes 
the  probability  of  detection,  p also  produces  a local  extremum  in  the  expected 

information  1^  . For  two  search  cells,  this  extremum  is  a local  minimum  of  1^. 

For  any  number  of  search  cells  greater  than  two,  this  extremum  is  a local  maximum 
of  Ig.  Tliis  is  consistent  with  Mela's  calculations.  (Although  the  number  of  looks 

is  only  equal  to  J in  Mela's  examples,  this  still  corresponds  to  the  large  search 
effort  limit,  since  C*  = 0 by  eq,  4,22. 


3.  The  local  extremum  of  I is  not  necessarily  the  global  extremum.  For  an 

11  MIN 

intermediate  range  of  C,  C*  < C < C,,-  , the  local  extremum  is  also  the  global 
MIN  B = oQ 

extremum.  For  C_,_  <C,  there  is  always  at  least  one  whereabouts  search  (and 

there  may  be  as  many  as  J),  corresponding  to  an  edge  point  of  the  I£  surface,  which 

produces  more  expected  information  than  the  optimal  detection  search.  In  practical 

cases  the  distinction  appears  unimportant,  since  P ~ 1 and  I ~ I , 

Jvl  ^ ***  b **  IW/V  A 

when  G reaches  CL,*  . 
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4.  In  the  small  search  effort  limit,  the  exact  correspondence  between  the  maximum 
of  and  the  extremum  of  I„  breaks  down.  We  achieve  the  maximum  rate  of 
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increase  of  I by  searching  in  the  cell  that  has  either  the  largest  or  second  largest 

rate  of  increase  of  detection  probability.  In  the  two -cell  case,  this  is  at  least  con- 
sistent with  the  large  search  effort  limit.  For  more  than  two  cells,  however,  this 
is  a distinct  phenomenon,  constituting  another  demonstration  that  the  connection 
between  optimum  search  and  optimum  information  gain  is  not  universal.  Working 

with  P„.  and  I directly,  rather  than  with  rates  of  change,  Browning  has  obtained 
u b 


comparable  results  in  the  intermediate  search  effort  region  for  cases  ]~2  and  J=3, 
There  remains  an  open  question:  precisely  how  the  transition  between  the  two 
limits  takes  place  and  whether  the  C*  determined  in  equation  4.22  is  in  fact  the 
minimum  value  fer  which  the  large  search  effort  limit  is  valid. 


5.  The  rate  of  change  of  I is  always  positive  for  any  search  allocation.  The 
rate  of  change  of  can  be  either  positive  or  negative,  depending  both  on  the 

search  allocation  and  on  the  ensemble  of  prior  probabilities  in  the  search  cells. 


b.  Variable  conditional  detection  probability, 

1.  In  the  large  search  effort  limit,  both  and  have  local  maxima.  The 

search  allocations  leading  to  those  maxima  are  not  the  same,  as  is  demonstrated  by 
eq.  4,77,  Phis  finding  is  consistent  with  Barker’s  second  theorem. 
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2.  In  the  large  search  effort  limit,  no  closed-form  solution  for  the  stationary 

points  of  I can  be  determined.  Even  the  nature  of  the  stationary  points  cannot 
h 

be  established  analytically,  but  the  numerical  examples  have  shown  that  maximum, 
minimum,  and  saddle  point  are  all  possibilities.  In  the  asymptotic  limit  (C  ), 
the  positioning  of  the  stationary  points  of  ?D,  H^,  and  Ig  all  converge  to  a 

common  limit: 

J 

(C/a  . ) (1/  S (1/a  , ) ) 

1 j"l  : 

3.  In  the  small  search  effort  limit,  no  systematic  relationships  have  been 

established  among  the  maxima  of  P , H , and  I . 

D ND  E 

4.  That  the  rate  of  change  of  I„  is  positive  for  any  search  allocation,  even 

hi 

when  the  conditional  detection  probabilities  vary,  remains  valid. 

Interpretation 

Only  in  the  case  of  uniform  conditional  detection  probability  do  we  have  enough 
results  to  attempt  some  interpretation.  There,  however,  some  simple  statements  can 
do  much  to  dispel  the  legacy  of  past  confusion. 

First,  using  Barker's  theorems  and  our  present  results  on  the  rates  of  '-hange  of 
Pp  end  , wfi  have  noted  that  identical  search  policies  produce  the  maximum 

increases  of  both  detection  probability  and  posterior  conditional  entropy.  Like  all 
entropies,  is  a negative  information.  Thus,  maximizing  its  rate  of  increase 

is  equivalent  to  maximizing  the  rate  of  decrease  of  an  information.  The  information 
that  is  being  decreased  is  chat  contained  in  the  er.  ;mble  up  to  the  time  of  detection: 
the  information  about  the  target  location  that  is  expressed  by  the  prior  probabilities 
and  that  changes  as  the  search  progresses.  With  this  view,  we  can  state  one  general 
conclusion,  already  correctly  anticipated  by  Richardson: 

• The  search  policy  that  maximizes  the  probability  of  detection  is 
the  one  that  uses  up  the  information  contained  in  the  prior  ensemble 
at  the  maximum  rate. 

Second,  we  have  shown  in  this  paper  tnat  in  most  cases  the  optimum  policies  for 
Pn  and  I are  equivalent.  Here,  however,  information  is  being  created  rather  than 

used  up,  Because  this  is  expected  information,  visualizing  the  overall  process  of 
information  creation  may  be  harder.  Consider  a repeated  search  in  one  of  the  J cells. 

If  that  cell  has  a prior  probability  of  containing  the  object,  its  initial  self  information 
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is  In  p , At  each  stage  of  the  search,  if  detection  does  not  occur,  p decreases; 

K N 

consequently,  the  actual  self  information  of  the  cell  also  decreases.  The  probabilities 
of  the  other  J-l  cells,  however,  increase  simultaneously  (because  of  the  constraint 

J 

£ p.  = 1).  as  do  their  values  of  self  information.  If  detection  does  occur  in  the  kth 

J-l  J 

cell  after  some  number  of  unsuccessful  searches,  Its  self  information  undergoes  a 
sudden  increase;  the  self  information  of  the  other  cells  then  drops. 

When  we  speak  of  ensemble  averages,  as  we  have  thrrughout  this  paper,  the  competing 
effects  of  these  information -creating  and  -destroying  processes  are  combined,  with  a 
probabilistic  weighting.  Until  the  time  of  detection,  the  loss  of  information  caused  by 
failure  to  detect  In  the  cell  being  searched  is  weighed  against  the  inferred  gain  in 
information  in  the  other  cells.  The  net  change  oi  information  may  be  either  positive 
or  negative,  as  we  noted  in  our  discussion  of  the  rate  of  change  of  . If  detection 

occurs,  the  positive  and  negative  Information  contributions  from  that  process  must 
also  be  added  in,  with  the  proper  weighting.  Then,  as  indicated  by  the  rate  of  change 
of  I,,  , the  overall  information  gain  is  net  positive. 

This  interpretation  of  the  creation  and  destruction  of  Information  leads  to  our  second 
general  conclusion: 

• In  a wide  range  of  practical  cases,  the  search  policy  that  maximizes 
the  detection  probability  is  the  one  that  creates  expected  Information  at 
the  maximum  rate.  We  achieve  that  maximum  rate  of  creation  by  using 
the  existing  information  of  the  prior  ensemble  as  fast  as  possible,  thereby 
gaining  an  early  detection,  with  its  concomitantly  large  increase  of 
expected  Information. 

Except  fur  the  anomalous  cases,  these  two  statements  contain  the  essence  of  the 
interpretation  of  optimum  search:  We  consume  existing  information  at  the  maximum 
rate  in  the  expectation  of  gaining  still  more  information,  also  at  the  maximum  rate. 

The  anomalous  cases  are  of  three  types:  (1)  the  case  of  two  search  cells,  in  which 
maximum  Information  gain  Is  achieved  by  search  in  the  cell  yielding  the  lower  detection 
probability,  regardless  of  the  amount  of  search  effort  applied  and  regardless  of  the 
probabilities  in  the  prior  ensemble;  (2)  the  case  of  three  or  more  search  cells,  in  which 
maximum  information  gain  Is  achieved  by  whereabouts  search  rather  than  optima!  detec- 
tion search  for  very  large  amounts  of  search  effort;  and  (3)  the  case  of  three  or  more 
search  cells,  in  which  maximum  information  gain  Is  achieved  by  searching  the  cell  that 
yields  the  second  highest  detection  probability,  but  only  for  small  amounts  of  search 
effort,  and  only  for  special  regions  in  the  space  of  prior  probabilities. 
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The  anomaly  in  the  first  two  cases  appears  to  be  caused  by  the  two  competing 
methods  of  increasing  information:  by  actual  detection  in  the  searched  cells,  or  by 
inference  that  the  target  is  in  the  other  cell,  based  on  failure  to  detect  in  the  searched 
cells.  These  two  mechanisms  are  at  work  regardless  of  the  number  of  cells.  When 
there  are  only  two  cells,  however,  failure  to  detect  In  one  leads  to  a large  increase 
In  the  probability  that  the  target  is  In  the  other,  and,  hence,  much  more  information 
about  the  location  of  the  target. 

This  mechanism  of  inferred  information  is  dominant  in  the  two-cell  case,  ao  that 
actual  detection  is  not  necessary  to  achieve  the  maximum  information  gain,  even  with 
small  amounts  of  search  effort.  The  two-cell  search  is  thus  an  exemplary  case  of 
the  whereabouts  search.  For  three  or  more  cells,  the  inferred  information  mechanism 
of  Ur?  whereabouts  search  is  not  normally  dominant;  It  becomes  important  only  at 
large  values  of  the  search  effort. 

In  the  second  case,  the  anomaly  appears  to  be  more  dependent  on  the  mathematical 
structure  of  the  ensemble  average  information  --  on  the  form  p In  p.  In  those  areas 
of  the  (Pj,  p2)  plane  where  the  anomoly  exists  (figure  13),  p^  is  large,  and  the  amount 

of  self -information  to  be  gained  by  a detection  in  cell  1 is  small.  Detection  in  cell  2, 
although  less  likely,  provides  more  self-information.  When  incorporated  in  the  ensemble 
average,  the  greater  but  less  likely  expected  information  contribution  from  cell  2 --  p2 

In  p2  --  dominates  the  smaller  but  more  likely  contribution  from  cell  1 — p^  In  p^. 

Future  Research 

Finally,  we  need  to  set  out  the  areas  in  which  further  research  may  prove  most 
fruitful . 

The  principal  case  treated  here  (uniform  conditional  detection  probability)  has  broad 
practical  applications.  Its  basic  structure  has  been  laid  out,  and  only  minor  holes  remain. 
Clearly,  there  are  unsolved  questions  relating  to  the  transition  between  the  small  and  large 

search  effort  limits  in  the  case  of  r . These  are  Important  theoretical  questions,  but 

1-0 

les3  so  in  a practical  sense. 

The  major  extensions  needed  for  use  in  real  world  search  theories  are,  first,  to 
false  targets,  second,  to  multiple  targets,  both  real  and  false,  and  finally,  to  moving 
targets.  Each  of  these  cases  is  of  immense  practical  significance,  and  any  theory  that 
does  not  Include  them  can  have  little  claim  to  completeness. 

Relaxation  of  the  assumption  of  exponential  detection  functions,  and  any  further  con- 
siderations of  the  case  of  varying  conditional  detection  probabilities  can  be  deferred  on  the 
grounds  ni  lower  practical  priority,  despite  their  considerable  theoretical  interest  and 
complex  mathematical  challenge. 
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